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Description 

[0001 ] The present invention relates to a modified DNA-polymerase having reverse transcriptase activity and reduced 
5'-3' exonuclease activity derived from a native polymerase which is obtainable from Carboxydothermus 
s hydrogenoformans. 

Furthermore the invention relates to the field of molecular biology and provides methods for amplifying a DNA segment 
from an RNA template using an enzyme with reverse transcriptase activity (RT-PCR). In another aspect, the invention 
provides a kit for Coupled High Temperature Reverse Transcription and Polymerase Chain Reaction. 
[0002] Heat stable DNA polymerases (EC 2.7.7.7. DNA nucleotidyltransferase, DNA-directed) have been isolated 
10 from numerous thermophilic organisms (for example: Kaledin et al. (1980), Biokhimiya 45, 644-651; Kaledin et al. 
(1981) Biokhimiya 46, 1576-1584; Kaledin et al. (1982) Biokhimiya 47, 1785-1791; Ruttimann et al. (1985) Eur. J. 
Biochem. 149, 41-46; Neuner etal. (1990) Arch. Microbiol. 153, 205-207). 

For some organisms, the polymerase gene has been cloned and expressed (Lawyer et al. (1989) J. Biol. Chem. 264, 
6427-6437; Engelke et al. (1990) Anal. Biochem. 191, 396-400; Lundberg et al. (1991) Gene 108, 1-6; Perler et al. 

15 (1 992) Proc. Natl. Acad. Sci. USA 89, 5577-5581). 

[0003] Thermophilic DNA polymerases are increasingly becoming important tools for use in molecular biology and 
there is growing interest in finding new polymerases which have more suitable properties and activities for use in diag- 
nostic detection of RNA and DNA, gene cloning and DNA sequencing. At present, the thermophilic DNA polymerases 
mostly used for these purposes are from Thermus species like Taq polymerase from T. aquaticus (Brock et al. (1969) 

20 J. Bacteriol. 98, 289-297) 

[0004] The term "reverse transcriptase" describes a class of polymerases characterized as RNA-dependent DNA- 
polymerases. All known reverse transcriptases require a primer to synthesize a DNA-transcript from an RNA template. 
Historically, reverse transcriptase has been used primarily to transcribe mRNA into cDNA which can then be cloned into 
a vector for further manipulation. 

25 [0005] Reverse transcription is commonly performed with viral reverse transcriptases like the enzymes isolated from 
Avian myeloblastosis virus or Moloney murine leukemia virus. Both enzymes mentioned are active in the presence of 
magnesium ions but have the disadvantages to possess RNase H-activity. which destroys the template RNA during the 
reverse transcription reaction and have a temperature optimum at 42°C or 37°C, respectively. Avian myoblastosis virus 
(AMV) reverse transcriptase was the first widely used RNA-dependent DNA-polymerase (Verma (1977) Biochem. Bio- 

30 phys. Acta 473, 1). The enzyme has 5'-3' RNA-directed DNA polymerase activity, 5'-3' DNA directed DNA polymerase 
activity, and RNaseH activity. RNaseH is a processive 5'-3' ribonuclease specific for the RNA strand of RNA-DNA 
hybrids (Perbal (1984), A Practical Guide to Molecular Cloning, Wiley & Sons New York). Errors in transcription cannot 
be corrected because known viral reverse transcriptases lack the 3'-5' exonuclease activity necessary for proofreading 
(Saunders and Saunders (1987) Microbial Genetics Applied to Biotechnology, Croom Helm, London). A detailed study 

35 of the activity of AMV reverse transcriptase and its associated RNaseH activity has been presented by Berger et al., 
(1983) Biochemistry 22, 2365-2372. 

[0006] DNA polymerases isolated from mesophilic microorganisms such as E. coli have been extensively character- 
ized (see, for example, Bessmann et al. (1957) J. Biol. Chem. 233, 171-177 and Buttin and Kornberg (1966) J. Biol. 
Chem. 241 , 5419-5427). E. coli DNA polymerase I (Pol I) is useful for a number of applications including: nick-transla- 
40 tion reactions, DNA sequencing, in vitro mutagenesis, second strand cDNA synthesis, polymerase chain reactions 
(PCR), and blunt end formation for linker ligation (Maniatis et al., (1982) Molecular Cloning: A Laboratory Manual Cold 
Spring Harbor, New York). 

[0007] Several laboratories have shown that some polymerases are capable of in vitro reverse transcription of RNA 
(Karkas (1 973) Proc. Nat. Acad. Sci. USA 70, 3834-3838; Gulati et al. (1 974) Proc. Nat. Acad. Sci. USA 71 , 1 035-1 039; 
45 and Wittig and Wittig, (1 978) Nuc. Acids Res. 5, 1 1 65-1 1 78). Gulati et al. found that E. coli Pol I could be used to tran- 
scribe Qp viral RNA using oligo(dT) 10 as a primer. Wittig and Wittig have shown that E.coli Pol I can be used to reverse 
transcribe tRNA that has been enzymatically elongated with oligo(dA). However, as Gulati et al. demonstrated, the 
amount of enzyme required and the small size of cDNA product suggest that the reverse transcriptase activity of E. coli 
Pol I has little practical value. 

so [0008] Alternative methods are described using the reverse transcriptase activity of DNA polymerases of thermophilic 
organisms which are active at higher temperatures. Reverse transcription at higher temperatures is of advantage to 
overcome secondary structures of the RNA template which could result in premature termination of products. Ther- 
mostable DNA polymerases with reverse transcriptase activities are commonly isolated from Thermus species. These 
DNA polymerases however, show reverse transcriptase activity only in the presence of manganese ions. These reac- 

55 tion conditions are suboptimal, because in the presence of manganese ions the polymerase copies the template RNA 
with low fidelity. 

[0009] Another feature of the commonly used reverse transcriptases is that they do not contain 3'-5' exonuclease 
activity. Therefore, misincorporated nucleotides cannot be removed and thus the cDNA copies from the template RNA 
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may contain a significant degree of mutations. 

[0010] One of the known DNA polymerases having high reverse transcriptase activity is obtainable from Thermus 
thermophilus (Tth polymerase) (WO 91/09944). Tth polymerase, as well as Taq polymerase, lacks 3' to 5' exonucleo- 
lytic proofreading activity. This 3* to 5' exonuclease activity is generally considered to be desirable because it allows 

5 removal of disincorporated or unmatched bases in the newly synthesized nucleic acid sequences. Another ther- 
mophilic pol l-type DNA polymerase isolated from Thermotoga maritima (Tma pol) has 3' to 5' exonuclease activity. 
U.S. patent 5,624,833 provides means for isolating and producing Tma polymerase. However, both DNA polymerases, 
Tth as well as Tma polymerase, show reverse transcriptase activity only in the presence of manganese ions. 
[0011] The DNA polymerase of Carboxydothermus hydrogenoformans shows reverse transcription activity in the 

10 presence of magnesium ions and in the substantial absence of manganese ions and can be used to reverse transcribe 
RNA, to detect and amplify (in combination with a thermostable DNA polymerase like Taq) specific sequences of RNA. 
Using DNA polymerase of Carboxydothermus hydrogenoformans polymerase a high specificity of transcription is 
observed with short incubation times. A high specificity is observed using e.g. 5 min of incubation time and 33 units of 
DNA polymerase protein. With longer incubation times also with lower amounts of Carboxydothermus 

15 hydrogenoformans polymerase specific products can be obtained. However an unspecific smear of products is occur- 
ring. These unspecific products might be caused by the 5'-3* exonuclease activity of the polymerase which enables the 
enzyme to cleave the template at secondary structures ( n RNaseH"-actrvity) and to create additional primers which can 
be elongated by the DNA polymerase activity. The thermostable DNA polymerase from Carboxydothermus 
hydrogenoformans has been identified and cloned and is described in the copending European application with the 

20 Application No. 961 15873.0, filed October 03, 1996, and incorporated herein by reference. 

[001 2] In summary, reverse transcriptases as MoMULV-RT or AMV-RT perform reverse transcription in the presence 
of magnesium-ions. However, these enzymes act at temperatures between 37°C and 55°C. Reverse transcription at 
higher temperatures would be desirable because secondary structures can be overcome in the template in order to 
avoid premature termination of the reaction and to assure the production of cDNA without deletions. 

25 Other enzymes e.g. DNA polymerase obtainable from Thermus spec, act as reverse transcriptase at temperatures up 
to 70°C in the presence of manganese ions. These reaction conditions are suboptimal, because in the presence of 
manganese ions the polymerase copies the template RNA with low fidelity and the RNA strand will be degraded. Deg- 
radation of the RNA strand occur's faster in the presence of manganese ions as in the presence of magnesium ions. 
Therefore, if manganese ions are present complexation of the manganese ions (e.g. with EDTA) is required after cDNA 

30 synthesis in order to obtain a higher fidelity during cDNA amplification in the subsequent PCR reaction. 
[001 3] Therefore, it is desirable to develop a reverse transcriptase 

• which acts at higher temperatures to overcome secondary structures in the template to avoid premature termina- 
tion of the reaction and to assure the production of cDNA without deletions 

35 • which is active in the presence of magnesium ions in order to prepare cDNA from RNA templates with higher fidelity 
and 

• which has 3 '-5 '-exonuclease in order to remove misincorporated nucleotides before continuation of DNA synthesis 
and to produce products with low mutation frequency 

• which has a high specificity and produces exclusively or predominantly RT-PCR products derived from specific 
40 primer binding. 

[0014] The present invention addresses these needs and provides a DNA polymerase mutant active at higher tem- 
peratures which has reverse transcriptase activity in the presence of magnesium ions and which has 3 '-5' exonuclease 
activity and reduced or no 5'-3' exonuclease activity. 

45 [0015] It is an object of this invention to provide a polymerase enzyme (EC 2.7.7.7.), characterized in that it has 
reverse transcriptase activity in the presence of magnesium ions as well as in the presence of manganese ions. In a 
further aspect the invention comprises a DNA polymerase having 3 -5 '-exonuclease activity and reduced 5'-3' exonu- 
clease activity. The enzyme according to the invention can be obtained from a polymerase obtainable from Carboxydo- 
thermus hydrogenoformans (Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg 

so 1b, D-38124 Braunschweig, DSM No. 8979). In a further aspect the invention is directed to a DNA polymerase with 
reduced 5'-3' exonuclease activity having reverse transcriptase activity in the presence of magnesiums ions and in the 
substantial absence of manganese ions. In a further aspect the invention comprises a DNA polymerase having a 
molecular mass of about 64 to 71 kDa as determined by SDS PAGE analysis. The mutant polymerase enzyme with 
reduced 5'-3* exonuclease activity derived from a polymerase obtainable from Carboxidothermus hydrogenoformans is 

55 called hereinafter A Chy Polymerase. In a further aspect the invention comprises a recombinant DNA sequence that 
encodes DNA polymerase activity of the A Chy Polymerase. In a related aspect, the DNA sequence is depicted as SEQ 
ID No. 10 (Figure 1). In a second related aspect the invention comprises a recombinant DNA sequence that encodes 
essentially amino acid residues 1 to 607 (SEQ ID No. 1 1 , Figure 1). In a further aspect the invention comprises a recom- 
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binant DNA plasmid that comprises the DNA sequence of the invention inserted into plasmid vectors and which can be 
used to drive the expression of the A Chy DNA polymerase in a host cell transformed with the plasmid. In a further 
aspect the invention includes a recombinant strain comprising the vector pDS56 carrying the A Chy DNA polymerase 
gene and designated pA 2 -225 AR 4- Tn © E.coli strain XL1 carrying the plasmid PA2.225AR4 was deposited on the Deut- 
5 sche Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascherorder Weg 1b, D-38124 Braunschweig DSM 
No. 1 1 854 (BMTU 7307) is designated E.coli GA1 . 

[0016] In referring to a peptide chain as being comprised of a series of amino acids "substantially or effectively" in 
accordance with a list offering no alternatives within itself, we include within that reference any versions of the peptide 
chain bearing substitutions made to one or more amino acids in such a way that the overall structure and the overall 
10 function of the protein composed of that peptide chain is substantially the same as - or undetectably different to - that 
of the unsubstituted version. For example it is generally possible to exchange alanine and valine without greatly chang- 
ing the properties of the protein, especially if the changed site or sites are at positions not critical to the morphology of 
the folded protein. 

[0017] 3'-5' exonuclease activity is commonly referred as "proofreading" or "editing" activity of DNA polymerases . It 

15 is located in the small domain of the large fragment of Type A polymerases. This activity removes mispaired nucleotides 
from the 3' end of the primer terminus of DNA in the absence of nucleoside triphosphates (Kornberg A. and Baker T. 
A.(1992) DNA Replication W. H. Freemann & Company, New York). This nuclease action is suppressed by deoxynucl- 
eoside triphosphates if they match to the template and can be incorporated into the polymer. 
[0018] The 3'-5' exonuclease activity of the claimed DNA polymerase can be measured as degradation or shortening 

20 of a S'-digoxygenin-labeled oligonucleotide annealed to template DNA in the absence or presence of deoxyribonucleo- 
side triphosphates or on DNA fragments in the absence or presence of deoxyribonucleoside triphosphates. 
[0019] Carboxydothermus hydrogenoformans DNA polymerase is the first DNA polymerase isolated from ther- 
mophilic eubacteria with a higher activity in the presence of magnesium ions than in the presence of manganese ions 
as shown in figure 2. The reverse transcriptase activity in dependence of magnesium is of advantage since the DNA 

25 polymerases synthesize DNA with higher fidelity in the presence of magnesium than in the presence of manganese 
(Beckmann R. A. et al. (1985) Biochemistry 24. 5810-5817; Ricchetti M. and Buc H. (1993) EMBOJ. 12, 387-396). Low 
fidelity DNA synthesis is likely to lead to mutated copies of the original template. In addition, Mn 2+ ions have been impli- 
cated in an increased rate of RNA degradation, particularly at higher temperatures and this can cause the synthesis of 
shortened products in the reverse transcription reaction. 

30 [0020] The DNA sequence (SEQ ID No.: 10) of A Chy polymerase and the derived amino acid sequence (SEQ ID No.: 
1 1) of the enzyme are shown in figure 1 . The molecular weight deduced from the sequence is 70,3 kDa, in SDS poly- 
acrylamide gel electrophoresis however A Chy polymerase has an electrophoretic mobility of approx. 65 kDa. 
[0021] The A Chy DNA Polymerase has reduced 5'-3' - exonuclease activity and has a temperature optimum at 72°C 
and exhibits reverse transcriptase activity at temperatures between 50 °C and 75 °C. 

35 [0022] When using A Chy DNA Polymerase obtainable from Carboxydothermus hydrogenoformans having reduced 
5'-3' - exonuclease activity in RT-PCR as reverse transcriptase with subsequent PCR reaction using Taq-polymerase as 
PCR enzyme a remarkable high sensitivity is achieved (Figure 3). The sensitivity of A Chy DNA Polymerase in RT-PCR 
is higher than the sensitivity of e.g. DNA polymerase from Thermus thermophilus (Tth polymerase) (Example 3, Figure 
4). A Chy DNA Polymerase also exhibits high sensitivity by amplifying a 1.83 kB fragment from total RNA from human 

40 muscle (Figure 5). The error rate of A Chy DNA Polymerase is 1,58 x 10~ 4 mutations per nucleotide per cycle and is 
therewith lower than the error rate of Tth Polymerase which is 2.37 x 10* 4 mutations per nucleotide per cycle. This 
results in higher fidelity of A Chy DNA polymerase in comparison to Tth Polymerase. 

[0023] Carboxydothermus hydrogenoformans was isolated from a hot spring in Kamchatka by V. Svetlichny. A sample 
of C. hydrogenoformans was deposited on the Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH 

45 (DSM) under the terms of the Budapest Treaty and received Accession Number DSM 8979. The thermostable polymer- 
ase isolated from Carboxydothermus hydrogenoformans has a molecular weight of 100 to 105 KDa. The thermostable 
enzyme possesses 5 '-3' polymerase activity, a 3 '-5 - exonuclease activity and a reverse transcriptase-activity which is 
Mg ++ -dependent. The thermostable enzyme may be native or recombinant and may be used for first- and second- 
strand cDNA synthesis, in cDNA cloning, DNA sequencing, DNA labeling and DNA amplification. 

so [0024] For recovering the native protein C.hydrogenoformans may be grown using any suitable technique, such as 
the technique described by Svetlichny et al. (1991) System. Appl. Microbiol. 14, 205-208. After cell growth one pre- 
ferred method for isolation and purification of the enzyme is accomplished using the multi-step process as follows: 
[0025] The cells are thawed, suspended in buffer A (40 mM Tris-HCI, pH 7.5, 0.1 mM EDTA, 7 mM 2-mercaptoethanol. 
0.4 M NaCI, 10 mM Pefabloc) and lysed by twofold passage through a Gaulin homogenizes The raw extract is cleared 

55 by centrifugation, the supernatant dialyzed against buffer B (40 mM Tris-HCI, pH 7.5, 0.1 mM EDTA, 7 mM 2-mercap- 
toethanol, 10 % Glycerol) and brought onto a column filled with Heparin-Sepharose (Pharmacia). In each case the col- 
umns are equilibrated with the starting solvent and after the application of the sample washed with the threefold of its 
volume with this solvent. Elution of the first column is performed with a linear gradient of 0 to 0.5 M NaCI in Buffer B. 
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The fractions showing polymerase activity are pooled and ammonium sulfate is added to a final concentration of 20 %. 
This solution is applied to a hydrophobic column containing Butyl -TSK-Toyopearl (TosoHaas). The column is eluted with 
a falling gradient of 20 to 0 % ammonium sulfate. The pool containing the activity is dialysed and again transferred to a 
column of DEAE-Sepharose (Pharmacia) and eluted with a linear gradient of 0-0.5 M NaCI in buffer B. The fourth col- 

5 umn contains Tris-Acryl-Blue (Biosepra) and is eluted as in the preceding case. Finally the active fractions are dialyzed 
against buffer C (20 mM Tris-HCI. pH 7.5, 0.1 mM EDTA, 7.0 mM 2-mercaptoethanol, 100 mM NaCI, 50 % Glycerol. 
[0026] DNA polymerase activity was measured by incorporation of digoxigenin-labeled dUTP into the synthesized 
DNA and detection and quantification of the incorporated digoxigenin essentially according to the method described in 
HOltke, H.-J.; Sagner, G; Kessler, C. and Schmitz, G. (1992) Biotechniques 12, 104-1 13. The reaction is performed in 

10 a reaction volume of 50 \i\ containing 1 or 2 \i\ of diluted (0.05 U - 0.01 U) DNA polymerase and 50 mM Tris-HCI, pH 
8.5; 12.5 mM (NH 4 ) 2 S0 4 ; 10 mM KCI; 5 mM MgCI 2 ; 10 mM 2-mercaptoethanol; 33 nM dNTPs; 200 ng/ml BSA; 12 ng 
of DNAse l-activated DNA from calf thymus and 0.036 nM digoxigenin-dUTP. 

[0027] The samples are incubated for 30 min. at 72°C, the reaction is stopped by addition of 2 nl 0.5 M EDTA, and 
the tubes placed on ice. After addition of 8 nl 5 M NaCI and 150 of Ethanol (precooled to -20°C) the DNA is precipi- 

15 tated by incubation for 15 min. on ice and pelleted by centrifugation for 10 min at 13000 x rpm and 4°C. The pellet is 
washed with 100 uJ of 70% Ethanol (precooled to -20°C) and 0.2 M NaCI, centrifuged again and dried under vacuum. 
[0028] The pellets are dissolved in 50 jil Tris-EDTA (10 mM/0.1 mM; pH 7.5). 5 nJ of the sample are spotted into a well 
of a nylon membrane bottomed white microwave plate (Pall Filtrationstechnik GmbH, Dreieich, FRG, product no: 
SM045BWP). The DNA is fixed to the membrane by baking for 10 min. at 70°C. The DNA loaded wells are filled with 

20 100 nl of 0.45 jim-filtrated 1 % blocking solution (100 mM maleic acid, 150 mM NaCI, 1 % (w/v) casein, pH 7.5). All fol- 
lowing incubation steps are done at room temperature. After incubation for 2 min. the solution is sucked through the 
membrane with a suitable vacuum manifold at -0.4 bar. After repeating the washing step, the wells are filled with 1 00 nl 
of a 1 :10 000-dilution of Anti -digoxigenin- AP, Fab fragments (Boehringer Mannheim, FRG, no: 1093274) diluted in the 
above blocking solution. After incubation for 2 min. and sucking this step is repeated once. The wells are washed twice 

25 under vacuum with 200 nl each time washing-buffer 1 (100 mM maleic-acid, 150 mM NaCI, 0.3 %(v/v) Tween™ 20, pH 
7.5). After washing another two times under vacuum with 200 each time washing-buffer 2 (10 mM Tris-HCI, 100 mM 
NaCI, 50 mM MgCI 2 , pH 9.5) the wells are incubated for 5 min. with 50 \i\ of CSPD™ (Boehringer Mannheim, no: 
1655884), diluted 1 : 100 in washing-buffer 2, which serves as a chemiluminescent substrate for the alkaline phos- 
phatase. The solution is sucked through the membrane and after 10 min. incubation the RLU/s (Relative Light Unit per 

30 second) are detected in a Luminometer e.g. MicroLumat LB 96 P (EG&G Berthold, Wilbad, FRG). 

[0029] With a serial dilution of Taq DNA polymerase a reference curve is prepared from which the linear range serves 
as a standard for the activity determination of the DNA polymerase to be analyzed. 

[0030] The Determination of reverse transcriptase activity is performed essentially as described for determination of 
DNA polymerase activity except that the reaction mixture consists of the following components: 1 *ig of polydA-(dT) 15 , 
35 33 nM of dTTP, 0.36 jiM of digoxigenin-dUTP, 200 mg/ml BSA, 10 mM Tris-HCI, pH 8.5, 20 mM KCI, 5 mM MgCI 2 , 10 
mM DTE and various amounts of DNA polymerase The incubation temperature used is 50°C. 
[0031 ] Isolation of recombinant DNA polymerase from Carboxydothermus hydrogenoformans may be performed with 
the same protocol or with other commonly used procedures. 

[0032] The production of a recombinant form of Carboxydothermus hydrogenoformans DNA polymerase generally 

40 includes the following steps: chromosomal DNA from Carboxydothermus hydrogenoformans is isolated by treating the 
cells with detergent e.g. SDS and a proteinase e.g. Proteinase K. The solution is extracted with phenol and chloroform 
and the DNA purified by precipitation with ethanol. The DNA is dissolved in Tris/EDTA buffer and the gene encoding the 
DNA polymerase is specifically amplified by the PCR technique using two mixed oligonucleotides (primer 1 and 2). 
These oligonucleotides, described by SEQ ID No.: 1 and SEQ ID No.: 2, were designed on the basis of conserved 

45 regions of family A DNA polymerases as published by Braithwaite D. K. and Ito J. (1 993) Nucl. Acids Res. 21 , 787 - 802. 
The specifically amplified fragment is ligated into an vector, preferably the pCR™ll vector (Invitrogen) and the sequence 
is determined by cycle-sequencing. Complete isolation of the coding region and the flanking sequences of the DNA 
polymerase gene can be performed by restriction fragmentation of the Carboxydothermus hydrogenoformans DNA with 
another restriction enzyme as in the first round of screening and by inverse PCR (Innis et al„ (1990) PCR Protocols; 

so Academic Press, Inc., 219-227). This can be accomplished with synthesized oligonucleotide primers binding at the 
outer DNA sequences of the gene part but in opposite orientation. These oligonucleotides described by SEQ ID Nos. 3 
and 4 , were designed on the basis of the sequences which were determined by sequencing of the first PCR product 
described above. As template DNA from Carboxydothermus hydrogenoformans is used which is cleaved by restriction 
digestion and circularized by contacting with T4 DNA ligase. To isolate the coding region of the entire polymerase gene, 

55 another PCR is performed using primers as shown in SEQ ID Nos. 5 and 6. The complete DNA polymerase gene is 
amplified directly from genomic DNA with primers suitable for introducing ends compatible with the linearized expres- 
sion vector. 
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SEQ ID No. 1: 

Primer 1 : 5-CCN AAY YTN CAR AAY ATH-3 ' 

SEQ ID No. 2: 
5 Primer 2: 5 '-YTC RTC RTG NAC YTG-3 ' 

SEQ ID No. 3: 

Primer 3: 5'-GGG CGA AGA CGC TAT ATT CCT GAG C-3* 

io SEQ ID NO. 4: 

Primer 4: 5*-GAA GCC TTA ATT CAA TCT GGG AAT AAT C-3* 

SEQ ID NO. 5: 

Primer 5: 5'-CGA ATT CAA TCC ATG GGA AAA GTA GTC CTG GTG GAT-3' 

15 

SEQ ID NO. 6: 

Primer 6: 5'-CGA ATT CAA GGA TCC TTA CTT CGC TTC ATA CCA GTT-3' 

[0033] The gene is operably linked to appropriate control sequences for expression in either prokaryotic or eucaryotic 

20 host/vector systems. The vector preferably encodes all functions required for transformation and maintenance in a suit- 
able host, and may encode selectable markers and/or control sequences for polymerase expression. Active recom- 
binant thermostable polymerase can be produced by transformed host cultures either continuously or after induction of 
expression. Active thermostable polymerase can be recovered either from host cells or from the culture media if the pro- 
tein is secreted through the cell membrane. 

25 [0034] The use of a plasmid as an appropriate vector has shown to be advantageously, particularly pDS56 (Stuber, 
D., Matile, H. and Garotta, G. (1990) Immunological Methods, Letkovcs, I. and Pernis, B., eds). The plasmid carrying 
the Carboxydothermus hydrogenoformans DNA polymerase gene is then designated pAR4. 
[0035] According to the present invention the use of the E. coli strain BL21 (DE3) pUBS520 (Brinkmann et al., (1989) 
Gene 85, 109-114) has shown to be advantageously. The E.coli strain BL 21 (DEB) pUBS 520 transformed with the 

30 plasmid pAR4 is then designated AR96 (DSM No 1 1 1 79). 

[0036] The mutant AChy was obtained by deletion of an N-terminal fragment of the recombinant wild type Carboxy- 
dothermus hydrogenoformans DNA polymerase using inverse PGR (Innis et al., (1990) PCR Protocols; Academic 
Press, Inc., p 219-227). The reverse primer used is complementary to the cloning site of the expression vector pDS56 
(Stuber, D. ( Matile, H. and Garotta, G. (1990) Immunological Methods, Letkovcs, I. and Pernis, B., eds.) at the Nco I 

35 restriction site (bases 120-151) and has the sequence: 

SEQ ID No. 7: 

Primer 7: 5'-CGG TAA ACC CAT GGT TAA TTT CTC CTC TTT AAT GAA TTC-3'. 

40 [0037] This primer contains additional 7 bases at the 5* end to ensure a better binding of the Nco I restriction enzyme 
in the subsequent restriction enzyme cleavage. The second (forward) primer was complementary to bases 676-702 of 
the wild type gene and has the sequence: 

SEQ ID No. 8: 

45 Primer 8: 5'-CGG GAA TCC ATG GAA AAG CTT GCC GAA CAC GAA AAT TTA-3*) 

[0038] The forward primer also contained an additional Nco I restriction site and additional 7 bases at the 5' -end. Plas- 
mid pDS56 DNA containing the polymerase-gene of Carboxydothermus hydrogenoformans at the Nco l/BamHI restric- 
tion sites was used as template for PCR. The PCR reaction was performed on the circular plasmid DNA pAR4. The 

so fragment encoding the mutated Carboxydothermus hydrogenoformans DNA polymerase (A Chy) and the vector DNA 
were amplified as linear DNA by PCR using the Expand High Fidelity PCR System (Boehringer Mannheim) according 
to the supplier's specifications (Fig. 7). The length of the gene encoding A Chy is 1821 bp. 
Amplification (PerWn Elmer GenAmp 9600 thermocycler) was carried out with the following conditions: 
2 min 94 °C. (10 sec 94 °C; 30 sec 65 °C; 4 min 68 °C) x 10; (10 sec 94 °C; 30 sec 65 °C; 4 min 68 °C) + cycle elonga- 

55 tion of 20 sec for each cycle) x 20; 7 min 72 °C; 

After PCR the amplified DNA was purified using the High Pure PCR Product Purification Kit (Boehringer Mannheim) 
and digested with Ncol (3U / ng DNA) for 16 h (Boehringer Mannheim) according to the supplier's specifications. 
For extraction with Phenol/Chloroform/lsoamylalcohol (24:24:1) the volume of the sample was raised to 100 »J with TE. 
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After extraction the DNA was precipitated by adding 1/10 volumes of 3M Sodium Acetate, pH 5.2 and 2 volumes of 
EtOH. The DNA was circularized using the Rapid DNA Ligation Kit (Boehringer Mannheim) according to the supplier's 
specification. The ligated products were introduced into E. coli XL1-Blue by transformation according to the procedure 
of Chung, C. T. et al. (1989) Proa Natl. Acad. Sci. USA 86, 21 72-21 75. Transformants were plated on L-agar containing 

5 100 ng/ml ampicillin to allow selection of recombinants. Colonies were picked and grown in L-broth containing 100 
ng/ml ampicillin. Plasmid DNA was prepared with the High Pure Plasmid Isolation Kit (Boehringer Mannheim) accord- 
ing to the supplier's specification. The plasmids were screened for insertions by digestion with Ncol/BamHI. Strains 
containing the genes of interest were grown in L-broth supplemented with 100 jig/ml ampicillin and tested for the 
expression of DNA polymerase / reverse transcriptase activity by induction of exponentially growing culture with 1 mM 

w IPTG and assaying the heat-treated extracts (72 °C) for DNA polymerase / reverse transcriptase activity as described 
above (determination of DNA polymerase activity and determination of reverse transcriptase activity). 
[0039] The present invention provides improved methods for efficiently transcribing RNA and amplifying RNA or DNA. 
These improvements are achieved by the discovery and application of previously unknown properties of thermoactive 
DNA polymerases with reverse transcriptase activity. 

is [0040] The enzyme of this invention may be used for any purpose in which such enzyme activity is necessary or 
desired. In a particularly preferred embodiment, the enzyme catalyzes reverse transcription of RNA which is amplified 
as DNA by a second DNA polymerase present in the amplification reaction known as RT-PCR (Powell et al. (1 987) Cell 
50, 831-840). Any ribonucleic acid sequence, in purified or nonpurified form, can be utilized as the starting nucleic 
acid(s), provided it contains or is suspected to contain the specific nucleic acid sequence desired. The nucleic acid to 

20 be amplified can be obtained from any source, for example, from plasmids such as pBR322, from cloned RNA, from 
natural RNA from any source, including bacteria, yeast, viruses, organelles, and higher organisms such as plants and 
animals, or from preparations of nucleic acids made in vitro. 

[0041 ] RNA may be extracted from blood, tissue material such as chorionic villi, or amniotic cells by a variety of tech- 
niques. See, e.g., Maniatis et al., (1982) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, Cold 
25 Spring Harbor, New York) pp. 280-281. Thus the process may employ, for example, RNA, including messenger RNA, 
which RNA may be single-stranded or double-stranded. In addition, a DNA- RNA hybrid which contains one strand of 
each may be utilized. 

[0042] The amplification of target sequences from RNA may be performed to proof the presence of a particular 
sequence in the sample of nucleic acid to be analyzed or to clone a specific gene. A Chy DNA polymerase is very useful 
30 for these processes. Due to its 3-5' exonuclease activity it is able to synthesize products with higher accuracy as the 
reverse transcriptases of the state of the art. 

[0043] A Chy DNA polymerase may also be used to simplify and improve methods for detection of RNA target mole- 
cules in a sample. In these methods A Chy DNA polymerase from Carboxydothermus hydrogenoformans may catalyze: 
(a) reverse transcription and (b) second strand cDNA synthesis. The use of DNA polymerase from Carboxydothermus 

35 hydrogenoformans may be used to perform RNA reverse transcription and amplification of the resulting complementary 
DNA with enhanced specificity and with fewer steps than previous RNA cloning and diagnostic methods. 
[0044] Another aspect of the invention comprises a kit for performing RT-PCR comprising A Chy polymerase, reaction 
buffers, nucleotide mixtures, and optionally a thermostable DNA polymerase for detection and amplification of RNA 
either in a one step reaction or for reverse transcription of the template RNA and subsequent amplification of the cDNA 

40 product. 

Brief Description of the Drawings 
[0045] 

45 

Fi gure 1 shows the nucleic acid and amino acid sequence of the "Klenow fragment" of Chy polymerase designated 
A Chy. 

Fig. 2 shows the reverse transcriptase activity of A Chy in dependence of magnesium and manganese salt. 

50 

Figure 3 shows the reverse transcription and amplification of a 997 bp fragment of the p-Actin gene from total 
mouse liver RNA using A Chy and the Expand HiFi-System and decreasing amounts of RNA. 

Figure 4 shows the reverse transcription and amplification of a 997 bp fragment of p-actin from total mouse liver 
55 RNA in comparison to Tth polymerase. Reverse transcription was either coupled with amplification ("one tube") 

using the Expand HiFi-System from Boehringer Mannheim, or after reverse transcription the Expand HiFi-System 
from Boehringer Mannheim was added to the reaction mixture for the subsequent amplification reaction ("two 
tube"). 
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Figure 5 shows the reverse transcription and amplification of a 1,83 kb fragment of Dystrophin from total human 
muscle RNA. 

Fi gure 6 shows the reverse transcription and amplification of a 324 bp fragment of p-actin from total mouse liver 
5 RNA with various amounts of Chy polymerase and various incubation times. 

Figure 7 shows schematically the construction of the clone encoding A Chy from the clone encoding the wild type 
gene. 

10 [0046] The following examples describe the invention in greater detail: 
Example 1 

Reverse transcription of a 324 bp p-actin fragment with Chy wild type DNA Polymerase used as Reverse Transcriptase 
15 followed by PCR with Taq-polymerase (Figure 6). 

[0047] The reaction mixture (20 nl) contained 200 ng total mouse liver RNA, 200 ^M dNTR 10 mM Tris-HCI, pH 8.8, 
5 mM DTT, 10 mM 2-mercaptoethanol, 15 mM KCI, 4.5 mM MgCI 2 , 0.02 mg/ml BSA, 20 pmol of reverse primer (p-actin 
reverse: 5-AAT TCG GAT GGC TAC GTA CAT GGC TG-3 1 ) and Chy-polymerase 33 units (lanes 1, 4, 7, 10, 13, 16), 
20 13,2 units (lanes 2, 5, 8, 11, 14, 17) and 6,6 units (lanes 3, 6, 9, 12, 15, 18). Reactions were incubated for 5 min (lanes 

1 to 6), 10 min (lanes 7 to 12) and 15 min (lanes 13 to 18) at 70 °C. 

20 nl of the reverse transcription reaction was used as template for PCR (100 nl reaction volume) with Taq-polymerase 
(Boehringer Mannheim) according to the supplier's specification using 20 pmol of forward and reverse primer (Primer 
sequence "p-actin forward": 5'AGC TTG CTG TAT TCC CCT CCA TCG TG-3', primer sequence "p-actin reverse": 5'- 
25 AAT TCG GAT GGC TAC GTA CAT GGC TG-3 1 ) and 200 \M dNTP's. 
Amplification was carried out using the following temperature profile: 

2 min 94 °C; (10 sec 94 °C; 30 sec 60 °C; 30 sec 72 °C) x 30; 7 min 72 °C 

Example 2 

30 

Construction of the vector expressing A Chy 

[0048] The mutant was obtained by deletion of an N-terminal fragment of recombinant wild type Carboxydothermus 
hydrogenoformans DNA polymerase using inverse PCR (Innis et al., (1990) PCR Protocols; Academic Press, Inc., p 

35 219-227). The reverse primer used is complementary to the cloning site of the expression vector pDS56 (Stuber, D., 
Matile, H. and Garotta, G. (1990) Immunological Methods, Letkovcs, I. and Pernis, B., eds.) at the Nco I restriction site 
(bases 120-151) and has the sequence: 5-CGG TAA ACC CAT GGT TAA TTT CTC CTC TTT AAT GAA TTC-3\ This 
primer contains additional 7 bases at the 5' end to ensure a better binding of the Nco I restriction enzyme in the subse- 
quent restriction enzyme cleavage. The second (forward) primer, was complementary to bases 676-702 of the wild type 

40 gene (sequence: S'-CGG GAA TCC ATG GAA AAG CTT GCC GAA CAC GAA AAT TTA-3'). The forward primer also 
contained an additional Nco I restriction site and additional 7 bases at the 5'-end. Plasmid pDS56 DNA containing the 
polymerase-gene of Carboxydothermus hydrogenoformans at the Nco l/BamHI restriction sites was used as template 
for PCR. The PCR reaction was performed on circular plasmid DNA pAR4. The fragment of Carboxydothermus 
hydrogenoformans DNA polymerase (AChy) and the vector DNA were amplified as linear DNA by PCR using the 

45 Expand High Fidelity PCR System (Boehringer Mannheim) according to the supplier's specifications. The length of the 
gene encoding A Chy is 1821 bp. 

Amplification (Perkin Elmer Gene Amp 9600 thermocycler) was carried out with the following conditions: 
2 min 94 °C, (10 sec 94 °C; 30 sec 65 °C; 4 min 68 °C) x 10; (10 sec 94 °C; 30 sec 65 °C; 4 min 68 °C) + cycle elonga- 
tion of 20 sec for each cycle) x 20; 7 min 72 °C; 

so After PCR the amplified DNA was purified using the High Pure PCR Product Purification Kit (Boehringer Mannheim) 
and digested with Ncol (3U / ng DNA) for 16 h (Boehringer Mannheim) according to the supplier's specifications. 
For extraction with Phenol/Chloroform/lsoamylalcohol (24:24:1) the volume of the sample was raised to 100 nl with TE. 
After extraction the DNA was precipitated by adding 1/10 volumes of 3M Sodium Acetate, pH 5.2 and 2 volumes of 
EtOH. The DNA was circularized using the Rapid DNA Ligation Kit (Boehringer Mannheim) according to the supplier's 

55 specification. The ligated products were introduced into E. coii XL1-Blue by transformation according to the procedure 
of Chung, C. T. et al. (1 989) Proc. Natl. Acad. Sci. USA 86, 21 72-21 75. Transformants were plated on L-agar containing 
100 ng/ml ampicillin to allow selection of recombinants. Colonies were picked and grown in L-broth containing 100 
ng/ml ampicillin. Plasmid DNA was prepared with the High Pure Plasmid Isolation Kit (Boehringer Mannheim) accord- 
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ing to the supplier's specification. The plasmids were screened for insertions by digestion with Ncol/BamHI. Strains 
containing the genes of interest were grown in L-broth supplemented with 100 tig/mi ampicillin and tested for the 
expression of DNA polymerase / reverse transcriptase activity by induction of exponentially growing culture with 1 mM 
IPTG and assaying the heat-treated extracts (72 °C) for DNA polymerase / reverse transcriptase activity as described 
5 above (determination of DNA polymerase activity and determination of Reverse Transcriptase activity). (Figure 7) 

Example 3 

Reverse transcription and amplification of a 997 bp fragment of p-actin from total mouse liver RNA. Comparison of A 
10 Chy with Tth polymerase in the reverse transcription reaction (Figure 4) either in a coupled RT-PCR reaction ("one 
tube") or in consecutive steps, reverse transcription, addition of polymerase and amplification of the cDNA product of 
the first step. 

[0049] 

15 

"one tube" system: 

The reactions (50 nl) contained 1 0 mM Tris-HCI, pH 8.8 at 25 °C, 1 5 mM KCI, 2,5 mM MgCI 2 , 400 \M of each dNTP, 
decreasing amounts of mouse total RNA (Clonetech) as indicated in the figure, 300 nM of each primer, 60 units of 
A Chy and 3,5 units of the Expand HiFi enzyme mix (Boehringer Mannheim GmbH). All reactions were incubated 
20 at 60 °C for 30 min (RT step). Amplification followed immediately with following cycle profile (Perkin Elmer Gene- 
Amp 9600 thermocycler): 

30 sec. at 94 °C; (30 sec at 94 °C. 30 sec at 60 °C, 1 min. at 68 °C) x 10; (30 sec. at 94°C, 30 sec. at 60°C, 1 min. 
at 68°C + cyle elongation of 5 sec. for each cycle) x 20; 7 min at 68 °C; 
"two tube" system: 

25 Reverse transcription is performed in 10 mM Tris-HCI, pH 8.8, 15 mM (NH 4 ) 2 S04, 0.1 % Tween, 4,5 mM MgCI 2 , 2 
% DMSO, 800 nM dNTPs, 300 nmoles of each primer, 60 units of A Chy, various amounts of total mouse muscle 
RNA as indicated in the figure. The reaction was performed in a volume of 25 *il for 30 min at 60 °C. 

[0050] 5 \l\ of this reaction are used for the amplification with the Expand HiFi-system from Boehringer Mannheim. 
30 Amplification was performed with 2,6 units of polymerase mixture in a reaction volume of 25 The following tempera- 
ture cycling conditions were used: 30 sec. at 94°C, (30 sec. at 94°C, 30 sec at 60°C, 1 min at 68°C) x 10, (30 sec. at 
94°C, 30 sec. at 60°C, 1 min at 68°C + cycle elongation for 5 sec for each cycle) X 20. 

[0051 ] As a control reaction the same template-primer system was used for RT-PCR with Tth polymerase (Boehringer 
Mannheim). The reaction was set up according to the supplier's specifications for the "one step" variant. 

35 
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15 



20 



25 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) STREET: Sandhof erstr . 116 

(C) CITY: Mannheim 

(E) COUNTRY: DE 

(F) POSTAL CODE (ZIP) : 68305 

(G) TELEPHONE: 06217595482 

(H) TELEFAX: 06217594457 



(ii) TITLE OF INVENTION: Modified DNA- Polymerase from carboxydo- 
thermus hydrogenf ormans and its use for Coupled Reverse Transcrip- 
tion and Polymerase Chain Reaction 

(iii) NUMBER OF SEQUENCES: 12 

30 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

35 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

40 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



11 



EP 0 922 765 A1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CCNAAYYTNC ARAAYATH 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
YTCRTCRTGN ACYTG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGG CGAAGAC GCTATATTCC TGAGC 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAAGCCTTAA TTCAATCTGG GAATAATC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGAATTCAAT CCATGGGAAA AGTAGTCCTG GTGGAT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CGAATTCAAG GATCCTTACT TCGCTTCATA CCAGTT 



36 



w 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGTAAACCC ATGGTTAATT TCTCCTCTTT AATGAATTC 39 

(2) INFORMATION FOR SEQ ID NO: 8: 



35 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 



40 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: other nucleic acid 



45 



(A) DESCRIPTION: 



/desc = "oligonucleotide" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



50 



CGGGAATCCA TGGAAAAGCT TGCCGAACAC GAAAATTTA 



39 



55 
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w 



20 



25 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
15 (A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AATTCGGATG GCTACGTACA TGGCTG 26 
(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1824 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



40 (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .1824 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATG GAA AAG CTT GCC GAA CAC GAA AAT TTA GCA AAA ATA TCG AAA CAA 48 
Met Glu Lys Leu Ala Glu His Glu Asn Leu Ala Lys lie Ser Lys Gin 
15 10 15 
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TTA GCT ACA ATC CTG CGG GAA ATA CCG TTA GAA ATC TCC CTG GAA GAT 96 
Leu Ala Thr He Leu Arg Glu He Pro Leu Glu He Ser Leu Glu Asp 
20 25 30 



10 



TTA AAA GTT AAA GAA CCT AAT TAT GAA GAA GTT GCT AAA TTA TTT CTT 144 
Leu Lys Val Lys Glu Pro Asn Tyr Glu Glu Val Ala Lys Leu Phe Leu 
35 40 45 



15 



CAC CTT GAG TTT AAA AGC TTT TTA AAA GAA ATA GAA CCA AAA ATA AAG 192 
His Leu Glu Phe Lys Ser Phe Leu Lys Glu He Glu Pro Lys lie Lys 
50 55 60 



20 



AAA GAA TAC CAG GAA GGT AAA GAT TTG GTG CAA GTT GAA ACT GTA GAA 24 0 
Lys Glu Tyr Gin Glu Gly Lys Asp Leu Val Gin Val Glu Thr Val Glu 
65 70 75 80 



25 



30 



35 



40 



ACG GAA GGA CAG ATT GCA GTA GTT TTT AGT GAT GGA TTT TAT GTT GAT 288 
Thr Glu Gly Gin He Ala Val Val Phe Ser Asp Gly Phe Tyr Val Asp 
85 90 95 

GAC GGG GAA AAA ACA AAG TTT TAC TCT TTA GAC CGG CTG AAT GAA ATA 336 
Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu Asp Arg Leu Asn Glu He 
100 105 110 

GAG GAA ATA TTT AGG AAT AAA AAA ATT ATT ACC GAC GAT GCC AAA GGA 384 
Glu Glu He Phe Arg Asn Lys Lys lie lie Thr Asp Asp Ala Lys Gly 
115 120 125 



45 



ATT TAT CAT GTC TGT TTA GAA AAA GGT CTG ACT TTT CCC GAA GTT TGT 432 
He Tyr His Val Cys Leu Glu Lys Gly Leu Thr Phe Pro Glu Val Cys 
130 135 140 



50 



TTT GAT GCG CGG ATT GCA GCT TAT GTT TTA AAC CCG GCC GAC CAA AAT 480 
Phe Asp Ala Arg lie Ala Ala Tyr Val Leu Asn Pro Ala Asp Gin Asn 
145 150 155 160 



55 



16 



EP0 922 765 A1 



CCC GGC CTC AAG GGG CTT TAT CTA AAG TAT GAC TTA CCG GTG TAT GAA 528 
Pro Gly Leu Lys Gly Leu Tyr Leu Lys Tyr Asp Leu Pro Val Tyr Glu 
165 170 175 



10 



GAT GTA TCT TTA AAC ATT AGA GGG TTG TTT TAT TTA AAA AAA GAA ATG 576 
Asp Val Ser Leu Asn lie Arg Gly Leu Phe Tyr Leu Lys Lys Glu Met 
180 185 190 



ATG AGA AAA ATC TTT GAG CAG GAG CAA GAA AGG TTA TTT TAT GAA ATA 624 
15 Met Arg Lys He Phe Glu Gin Glu Gin Glu Arg Leu Phe Tyr Glu He 

195 200 205 



20 



GAA CTT CCT TTA ACT CCA GTT CTT GCT CAA ATG GAG CAT ACC GGC ATT 672 
Glu Leu Pro Leu Thr Pro Val Leu Ala Gin Met Glu His Thr Gly He 
210 215 220 



25 



CAG GTT GAC CGG GAA GCT TTA AAA GAG ATG TCG TTA GAG CTG GGA GAG 720 
Gin Val Asp Arg Glu Ala Leu Lys Glu Met Ser Leu Glu Leu Gly Glu 
225 230 235 240 



30 



35 



40 



45 



50 



CAA ATT GAA GAG TTA ATC CGG GAA ATT TAT GTG CTG GCG GGG GAA GAG 768 
Gin He Glu Glu Leu He Arg Glu He Tyr Val Leu Ala Gly Glu Glu 
245 250 255 

TTT AAC TTA AAC TCG CCC AGG CAG CTG GGA GTT ATT CTT TTT GAA AAA 816 
Phe Asn Leu Asn Ser Pro Arg Gin Leu Gly Val He Leu Phe Glu Lys 
260 265 270 

CTT GGG CTG CCG GTA ATT AAA AAG ACC AAA ACG GGC TAC TCT ACC GAT 864 
Leu Gly Leu Pro Val He Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp 
275 280 285 

GCG GAG GTT TTG GAA GAG CTC TTG CCT TTC CAC GAA ATT ATC GGC AAA 912 
Ala Glu Val Leu Glu Glu Leu Leu Pro Phe His Glu He lie Gly Lys 
290 295 300 



ATA TTG AAT TAC CGG CAG CTT ATG AAG TTA AAA TCC ACT TAT ACT GAC 960 
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10 



15 



25 



35 



40 



lie Leu Asn Tyr Arg Gin Leu Met Lys Leu Lys Ser Thr Tyr Thr Asp 
305 310 315 320 

GGC TTA ATG CCT TTA ATA AAT GAG CGT ACC GGT AAA CTT CAC ACT ACT 1008 
Gly Leu Met Pro Leu He Asn Glu Arg Thr Gly Lys Leu His Thr Thr 
325 330 335 

TTT AAC CAG ACC GGT ACT TTA ACC GGA CGC CTG GCG TCT TCG GAG CCC 1056 
Phe Asn Gin Thr Gly Thr Leu Thr Gly Arg Leu Ala Ser Ser Glu Pro 
340 345 350 



AAT CTC CAA AAT ATT CCC ATC CGG TTG GAA CTC GGT CGG AAA TTA GGC 1104 
20 Asn Leu Gin Asn He Pro He Arg Leu Glu Leu Gly Arg Lys Leu Arg 
355 360 365 



AAG ATG TTT ATA CCT TCA CCG GGG TAT GAT TAT ATT GTT TCG GCG GAT 1152 
Lys Met Phe He Pro Ser Pro Gly Tyr Asp Tyr He Val Ser Ala Asp 
370 375 380 

30 TAT TCC CAG ATT GAA TTA AGG CTT CTT GCC CAT TTT TCC GAA GAG CCC 1200 
Tyr Ser Gin He Glu Leu Arg Leu Leu Ala His Phe Ser Glu Glu Pro 
385 390 395 400 



50 



AAG CTT ATT GAA GCT TAC CAA AAA GGG GAG GAT ATT CAC CGG AAA ACG 124 8 
Lys Leu He Glu Ala Tyr Gin Lys Gly Glu Asp lie His Arg Lys Thr 
405 410 415 

GCC TCC GAG GTG TTC GGT GTA TCT TTG GAA GAA GTT ACT CCC GAG ATG 1296 
Ala Ser Glu Val Phe Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met 
420 425 430 

CGC GCT CAT GCC AAG TCG GTG AAC TTC GGC ATT GTT TAT GGC ATT AGT 1344 
Arg Ala His Ala Lys Ser Val Asn Phe Gly He Val Tyr Gly He Ser 
435 440 445 
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10 



15 



20 



25 



30 



35 



GAT TTT GGT TTA GGC AGA GAC TTA AAG ATT CCC CGG GAG GTT GCC GGT 1392 
Asp Phe Gly Leu Gly Arg Asp Leu Lys lie Pro Arg Glu Val Ala Gly 
450 455 460 

AAG TAG ATT AAA AAT TAT TTT GCC AAC TAT CCC AAA GTG CGG GAG TAT 1440 
Lys Tyr He Lys Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr 
465 470 475 480 

CTC GAT GAA CTT GTC CGT ACG GCA AGA GAA AAG GGA TAT GTG ACC ACT 1488 
Leu Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 
485 490 495 

TTA TTT GGG CGA AGA CGC TAT ATT CCT GAG CTA TCT TCA AAA AAC CGC 1536 
Leu Phe Gly Arg Arg Arg Tyr He Pro Glu Leu Ser Ser Lys Asn Arg 
500 505 510 

ACG GTT CAG GGT TTT GGC GAA AGG ACG GCC ATG AAT ACT CCC CTT CAG 1584 
Thr Val Gin Gly Phe Gly Glu Arg Thr Ala Met Asn Thr Pro Leu Gin 
515 520 525 

GGC TCG GCT GCC GAT ATT ATT AAG CTT GCA ATG ATT AAT GTA GAA AAA 1632 
Gly Ser Ala Ala Asp He He Lys Leu Ala Met He Asn Val Glu Lys 
530 535 540 



GAA CTT AAA GCC CGT AAG CTT AAG TCC CGG CTC CTT CTT TCG GTG CAC 1680 
Glu Leu Lys Ala Arg Lys Leu Lys Ser Arg Leu Leu Leu Ser Val His 
<o 545 550 555 560 

GAT GAG TTA GTT TTA GAA GTG CCG GCG GAA GAG CTG GAA GAG GTA AAA 1728 
Asp Glu Leu Val Leu Glu Val Pro Ala Glu Glu Leu Glu Glu Val Lys 
565 570 575 



50 



GCG CTG GTA AAA GGG GTT ATG GAG TCG GTG GTT GAA CTG AAA GTG CCT 1776 
Ala Leu Val Lys Gly Val Met Glu Ser Val Val Glu Leu Lys Val Pro 
580 585 590 
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20 
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30 



40 



45 



50 



55 



TTA ATC GCT GAA GTT GGT GCA GGC AAA AAC TGG TAT GAA GCG AAG TAA 1824 
Leu lie Ala Glu Val Gly Ala Gly Lys Asn Trp Tyr Glu Ala Lys * 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Glu Lys Leu Ala Glu His Glu Asn Leu Ala Lys lie Ser Lys Gin 
15 10 15 

Leu Ala Thr lie Leu Arg Glu He Pro Leu Glu He Ser Leu Glu Asp 
20 25 30 

Leu Lys Val Lys Glu Pro Asn Tyr Glu Glu Val Ala Lys Leu Phe Leu 
35 40 45 



His Leu Glu Phe Lys Ser Phe Leu Lys Glu He Glu Pro Lys He Lys 

35 50 55 60 

Lys Glu Tyr Gin Glu Gly Lys Asp Leu Val Gin Val Glu Thr Val Glu 

65 70 75 80 



Thr Glu Gly Gin He Ala Val Val Phe Ser Asp Gly Phe Tyr Val Asp 
85 90 95 

Asp Gly Glu Lys Thr Lys Phe Tyr Ser Leu Asp Arg Leu Asn Glu He 
100 105 110 

Glu Glu He Phe Arg Asn Lys Lys He He Thr Asp Asp Ala Lys Gly 
115 120 125 
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lie Tyr His Val Cys Leu Glu Lys Gly Leu Thr Phe Pro Glu Val Cys 
130 135 140 

5 

Phe Asp Ala Arg lie Ala Ala Tyr Val Leu Asn Pro Ala Asp Gin Asn 
145 150 155 160 

10 

Pro Gly Leu Lys Gly Leu Tyr Leu Lys Tyr Asp Leu Pro Val Tyr Glu 
165 170 175 

15 Asp Val Ser Leu Asn lie Arg Gly Leu Phe Tyr Leu Lys Lys Glu Met 

180 185 190 

20 Met Arg Lys He Phe Glu Gin Glu Gin Glu Arg Leu Phe Tyr Glu He 

195 200 205 



Glu Leu Pro Leu Thr Pro Val Leu Ala Gin Met Glu His Thr Gly He 
25 210 215 220 

Gin Val Asp Arg Glu Ala Leu Lys Glu Met Ser Leu Glu Leu Gly Glu 
30 225 230 235 240 



35 



Gin He Glu Glu Leu He Arg Glu lie Tyr Val Leu Ala Gly Glu Glu 
245 250 255 



40 



45 



Phe Asn Leu Asn Ser Pro Arg Gin Leu Gly Val He Leu Phe Glu Lys 
260 265 270 

Leu Gly Leu Pro Val He Lys Lys Thr Lys Thr Gly Tyr Ser Thr Asp 
275 280 285 

Ala Glu Val Leu Glu Glu Leu Leu Pro Phe His Glu lie lie Gly Lys 
290 295 300 
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lie Leu Asn Tyr Arg Gin Leu Met Lys Leu Lys Ser Thr Tyr Thr Asp 
305 310 315 320 
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10 



Gly Leu Met Pro Leu lie Asn Glu Arg Thr Gly Lys Leu His Thr Thr 
325 330 335 

Phe Asn Gin Thr Gly Thr Leu Thr Gly Arg Leu Ala Ser Ser Glu Pro 
340 345 350 

Asn Leu Gin Asn lie Pro lie Arg Leu Glu Leu Gly Arg Lys Leu Arg 
355 360 365 



15 



Lys Met Phe lie Pro Ser Pro Gly Tyr Asp Tyr lie Val Ser Ala Asp 
370 375 380 



20 



Tyr Ser Gin lie Glu Leu Arg Leu Leu Ala His Phe Ser Glu Glu Pro 
385 390 395 400 



25 



Lys Leu lie Glu Ala Tyr Gin Lys Gly Glu Asp lie His Arg Lys Thr 
405 410 415 



30 



Ala Ser Glu Val Phe Gly Val Ser Leu Glu Glu Val Thr Pro Glu Met 
420 425 430 



35 



40 



Arg Ala His Ala Lys Ser Val Asn Phe Gly lie Val Tyr Gly lie Ser 
435 440 445 

Asp Phe Gly Leu Gly Arg Asp Leu Lys lie Pro Arg Glu Val Ala Gly 
450 455 460 

Lys Tyr lie Lys Asn Tyr Phe Ala Asn Tyr Pro Lys Val Arg Glu Tyr 
465 470 475 480 



45 



Leu Asp Glu Leu Val Arg Thr Ala Arg Glu Lys Gly Tyr Val Thr Thr 
485 490 495 
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Leu Phe Gly Arg Arg Arg Tyr lie Pro Glu Leu Ser Ser Lys Asn Arg 
500 505 510 
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Thr Val Gin Gly Phe Gly Glu Arg Thr Ala Met Asn Thr Pro Leu Gin 
515 520 525 

Gly Ser Ala Ala Asp lie He Lys Leu Ala Met He Asn Val Glu Lys 
530 535 540 

Glu Leu Lys Ala Arg Lys Leu Lys Ser Arg Leu Leu Leu Ser Val His 
545 550 555 560 

Asp Glu Leu Val Leu Glu Val Pro Ala Glu Glu Leu Glu Glu Val Lys 
565 570 575 



20 Ala Leu Val Lys Gly Val Met Glu Ser Val Val Glu Leu Lys Val Pro 
580 585 590 
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35 



40 



45 



50 



Leu He Ala Glu Val Gly Ala Gly Lys Asn Trp Tyr Glu Ala Lys 
595 600 605 



30 (2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AGCTTGCTGT ATTCCCCTCC ATCGTG 26 



55 Claims 



1 . A purified DNA polymerase exhibiting reverse transcriptase activity in the presence of magnesium ions and/or man- 
ganese ions having reduced or no 5'-3'-exonuclease activity and substantially no RNaseH activity and obtainable 
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from Carboxydothermus hydrogenoformans. 

2. A DNA polymerase as claimed in claim 1 wherein said DNA polymerase exhibits reverse transcriptase activity in 
the substantial absence of manganese ions. 

3. A DNA polymerase as claimed in claim 1 or 2 wherein said polymerase exhibits a reverse transcriptase activity 
which is manganese-dependent. 

4. A DNA polymerase according to claims 1 - 3 , wherein the magnesium dependent reverse transcriptase activity is 
higher than the manganese dependent reverse transcriptase activity of said polymerase. 

5. A DNA polymerase as claimed in any of claims 1-4, wherein said polymerase is a mutant with reduced or no 5'- 
3 'exonuclease activity derived from a naturally occurring polymerase possessing 5 '-3'exonuclease activity. 

6. A DNA polymerase as claimed in any of claims 1-5, wherein said polymerase has an apparent molecular weight 
between about 64 to 71 kDa as determined by SDS polyacrylamide electrophoresis. 

7. A recombinant DNA polymerase as claimed in any of claims 1-6, wherein said polymerase is obtainable from E. 
coli. the strain being designated E.coli GA1. 

8. An isolated DNA sequence coding for the polymerase as claimed in any one of claims 1 -7. 

9. A recombinant DNA sequence capable of encoding a DNA polymerase as claimed in any one of claims 1 -7. 

10. An isolated DNA sequence represented by the formula shown in SEQ ID No. 10. 

11. A vector containing the isolated DNA sequence as claimed in any of claims 8-10. 

12. A vector according to claim 11, wherein such vector is plasmid pDS56 carrying a deletion mutant of the Carboxy- 
dothermus hydrogenoformans DNA polymerase gene and is then designated PA2.225AR4. 

13. The vector according to claim 1 1 providing some or all of the following features: 

(1) promotors or sites of initiation of transcription 

(2) operators which could be used to turn gene expression on or off 

(3) ribosome binding sites for improved translation 

(4) transcription or translation termination sites 

14. A microbial host transformed with the vector of claims 11-13. 

15. A microbial host according to claim 14 wherein said transformant is E. coli, the strain being designated E.coli GA1. 

16. A process for the preparation of DNA polymerase according to any of the claims 1-7 comprising the steps: 

(a) culturing the natural strain Carboxydothermus hydrogenoformans 

(b) suspending the cells of the natural strain in buffer 

(c) disrupting the cells 

(d) purifying the DNA polymerase by chromatographic steps including the use of one or more Sepharose-col- 
umns. 

1 7. A process for the preparation of DNA polymerase according to any one of claims 1 -7 comprising growing a recom- 
binant E. coli strain transformed with a vector according to claims 11-13 and purifying and isolating the DNA 
polymerase. 

1 8. A process of amplifying RNA, characterized in that a thermophilic DNA polymerase as claimed in any one of claims 
1 -7 is used in combination with a thermostable DNA polymerase. 

1 9. A process for cDN A cloning and DNA sequencing, characterized in that a thermophilic DNA polymerase as claimed 
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in any one of claims 1-7 is used. 

20. A process for DNA labeling, characterized in that a thermophilic DNA polymerase as claimed in any one of claims 
1-7 is used. 

21. A process for reverse transcription of RNA to cDNA characterized in that a thermophilic DNA polymerase as 
claimed in any one of claims 1 -7 is used. 

22. A kit useful for RT-PCR comprising reverse transcription of RNA using a thermophilic DNA polymerase as claimed 
in any one of claims 1 -7 and amplification of the cDNA product by a thermostable DNA polymerase either in a com- 
bined reaction (RT and PCR) or for consecutive reactions (RT and subsequently PCR). 
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Figure I : 

Nucleic acid (SEQ ID NO: 10) and protein (SEQ ID NO: 11) sequence of A Chy DNA 
polymerase 

1 ATGGAAAAGCTTGCCGAACACGAAAATTTAGCAAAAATATCGAAACAATTAGCTACAATC 

1 MEKLAEHENLAKISKQLATI 

61 CTGCGGGAAATACCGTTAGAAATCTCCCTGGAAGATTTAAAAGTTAAAGAACCTAATTAT 

21 LREIPLEISLEDLKVKEPNY 

121 GAAGAAGTTGCTAAATTATTTCTTCACCTTGAGTTTAAAAGCTTTTTAAAAGAAATAGAA 

41 EE V A K L FLHLEFKSFLKEIE 

181 CCAAAAATAAAGAAAGAATACCAGGAAGGTAAAGATTTGGTGCAAGTTGAAACTGTAGAA 

61 PKIKKEYQEGKDLVQVETVE 

241 ACGGAAGGACAGATTGCAGTAGTTTTTAGTGATGGATTTTATGTTGATGACGGGGAAAAA 

81 TEGQIAVVFSDGFYVDDGEK 

301 ACAAAGTTTTACTCTTTAGACCGGCTGAATGAAATAGAGGAAATATTTAGGAATAAAAAA 

101 TKFY SLDRLNEIEEIFRNKK 

361 ATTATTACCGACGATGCCAAAGGAATTTATCATGTCTGTTTAGAAAAAGGTCTGACTTTT 

121 II TDDAKGIYHVCLEKGL T F 

421 CCCGAAGTTTGTTTTGATGCGCGGATTGCAGCTTATGTTTTAAACCCGGCCGACCAAAAT 

141 PEVCFDARIAAYVLNPADQN 

481 CCCGGCCTCAAGGGGCTTTATCTAAAGTATGACTTACCGGTGTATGAAGATGTATCTTTA 

161 PGLKGLYLKYDLPVYEDVSL 

541 AACATTAGAGGGTTGTTTTATTTAAAAAAAGAAATGATGAGAAAAATCTTTGAGCAGGAG 

181 N I R G L FYLKKEMMRKIFEQE 
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601 CAAGAAAGGTTATTTTATGAAATAGAACTTCCTTTAACTCCAGTTCTTGCTCAAATGGAG 

201 QERLFYEIELPL TPVLAQME 

661 CATACCGGCATTCAGGTTGACCGGGAAGCTTTAAAAGAGATGTCGTTAGAGCTGGGAGAG 

221 HTGIQVDREALKEMSLELGE 

721 CAAATTGAAGAGTTAATCCGGGAAATTTATGTGCTGGCGGGGGAAGAGTTTAACTTAAAC 

241 QIEELIREIYVLAGEEFNLN 

781 TCGCCCAGGCAGCTGGGAGTTATTCTTTTTGAAAAACTTGGGCTGCCGGTAATTAAAAAG 

261 SPRQLGVILFEKLGLPVIKK 

* 

841 ACCAAAACGGGCTACTCTACCGATGCGGAGGTTTTGGAAGAGCTCTTGCCTTTCCACGAA 

281 TKTGYSTDAEVLEELLPFHE 

901 ATTATCGGCAAAATATTGAATTACCGGCAGCTTATGAAGTTAAAATCCACTTATACTGAC 

301 IIGKILNYRQLMKLKSTYTD 

961 GGCTTAATGCCTTTAATAAATGAGCGTACCGGTAAACTTCACACTACTTTTAACCAGACC 

321 GLMPLINERTGKLHTTFNQT 

1021 GGTACTTTAACCGGACGCCTGGCGTCTTCGGAGCCCAATCTCCAAAATATTCCCATCCGG 

341 GTLTGRLASSEPNLQNIPIR 

1081 TTGGAACTCGGTCGGAAATTACGCAAGATGTTTATACCTTCACCGGGGTATGATTATATT 

361 LELGRKLRKMFIPSPGYDYI 

1141 GTTTCGGCGGATTATTCCCAGATTGAATTAAGGCTTCTTGCCCATTTTTCCGAAGAGCCC 

381 VSADYSQIELRLLAHF SEEP 

1201 AAGCTTATTGAAGCTTACCAAAAAGGGGAGGATATTCACCGGAAAACGGCCTCCGAGGTG 

401 K L I E A YQKGEDIHRKTASEV 
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1261 TTCGGTGTATCTTTGGAAGAAGTTACTCCCGAGATGCGCGCTCATGCCAAGTCGGTGAAC 

421 FGVSLEEVTPEMRAHAKSVN 

1321 TTCGGCATTGTTTATGGCATTAGTGATTTTGGTTTAGGCAGAGACTTAAAGATTCCCCGG 

441 FGIVYGISDFGLGRDLKIPR 

1381 GAGGTTGCCGGTAAGTACATTAAAAATTATTTTGCCAACTATCCCAAAGTGCGGGAGTAT 

461 E VAGKYIKNYFANYPKVRE Y 

1441 CTCGATGAACTTGTCCGTACGGCAAGAGAAAAGGGATATGTGACCACTTTATTTGGGCGA 

481 L D E L V R TAREKGYVTTLFGR 

1501 AGACGCTATATTCCTGAGCTATCTTCAAAAAACCGCACGGTTCAGGGTTTTGGCGAAAGG 

501 R R YIPELSSKNR TVQGFGER 

1561 ACGGCCATGAATACTCCCCTTCAGGGCTCGGCTGCCGATATTATTAAGCTTGCAATGATT 

521 TAMNTPLQGSAADI I K L A M I 

1621 AATGTAGAAAAAGAACTTAAAGCCCGTAAGCTTAAGTCCCGGCTCCTTCTTTCGGTGCAC 

541 NVEKELKARKLKSRLLLS V H 

1681 GATGAGTTAGTTTTAGAAGTGCCGGCGGAAGAGCTGGAAGAGGTAAAAGCGCTGGTAAAA 

561 DELVLEVPAEELEEVKALVK 

1741 GGGGTTATGGAGTCGGTGGTTGAACTGAAAGTGCCTTTAATCGCTGAAGTTGGTGCAGGC 

581 GVMESVVELKVPLIAEVGAG 

1801 AAAAACTGGTATGAAGCGAAGTAA 

601 KNWYE AK* 
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Figure 2: 
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Fiuure 4: 
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Figure 7: 
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